# Browser Automation

cursor-tools
Cursor Tools
cursor-tools is a plugin that enhances Cursor programming tools. By integrating with AI models like Perplexity and Gemini, it offers powerful capabilities such as context understanding of code, automated browser operations, and GitHub integration. The main advantage of this tool is its ability to significantly boost development efficiency, allowing developers to quickly resolve complex issues while supporting both local and remote codebase operations. cursor-tools is positioned as an intelligent assistant for developers, suitable for scenarios that require efficient code management and automation testing. It is open-sourced on GitHub and available for free use.
Coding Assistant
63.8K
EasyWeb
Easyweb
EasyWeb is an AI-based open platform focused on constructing and deploying intelligent agents capable of interacting with web browsers. It provides a user-friendly interface that allows users to swiftly deploy AI agents for various browser-related tasks, such as travel planning, online shopping, and news gathering. Built on the OpenHands architecture, it supports the parallel processing of multiple user requests, allowing users to switch between different agents and large language models (LLMs) as needed. Its key advantages include simple deployment, user convenience, support for multiple task types, and complete open source availability, making it ideal for developers and researchers for further development and research. The emergence of EasyWeb presents new possibilities for AI's application in task automation, while also providing substantial support for research and development in related fields.
Development Platform
55.5K
Project Mariner
Project Mariner
Project Mariner is an early research prototype developed by Google DeepMind based on the Gemini 2.0 model, aimed at exploring future human-computer interaction methods, particularly within web browsers. This project is capable of understanding information on the browser screen, including pixels and web elements such as text, code, images, and forms, and utilizing this information to accomplish tasks. Technically, Project Mariner allows direct operations within the browser through a Chrome extension, providing users with a novel agent service experience.
AI search
53.5K
Cerebellum
Cerebellum
Cerebellum is a lightweight browser agent that achieves user-defined goals on web pages through keyboard and mouse actions. It simplifies web browsing into a navigable directed graph, using large language models (LLMs) to analyze page content and interactive elements, thus determining the next actions. With its innovative AI-driven automation technology, Cerebellum improves the efficiency and accuracy of web automation tasks. Currently, it is compatible with any Selenium-supported browser and can populate forms using user-provided JSON data. The product is in its Beta stage and is available for free to developers and researchers.
Automated Workflow
57.4K
Sentient
Sentient
Sentient is a framework/SDK that allows developers to build intelligent proxies capable of controlling a browser in just 3 lines of code. It leverages the latest AI technologies to perform complex web interactions and automation tasks with simple code. Sentient supports various AI models, including OpenAI and Together AI, providing customized solutions based on user-specific requirements.
AI Development Assistant
51.3K
English Picks
MrScraper
Mrscraper
MrScraper is a versatile web data scraping tool that enables users to extract data from various websites without the need for coding knowledge. It automatically extracts the desired information using intelligent technology, supports large-scale request handling, and features browser automation capabilities. Users can easily create scrapers, customize selectors, and set scraping tasks according to their needs. Background information shows that MrScraper is trusted by world-leading companies, boasting robust enterprise-level performance capable of handling millions of data points.
Data Analysis
61.3K
Fresh Picks
Crawlee for Python
Crawlee For Python
Crawlee is a Python library for building reliable web crawlers. Developed by experienced web crawling professionals, it's used daily to crawl millions of pages. Crawlee supports JavaScript rendering, allowing you to easily switch to browser crawling without rewriting code. It also offers automatic proxy rotation and management, intelligently managing and cycling through proxies based on system resources and discarding those frequently encountering timeouts or network errors.
Development & Tools
56.9K
AIEmploye
Aiemploye
AIEmploye is a GPT-4 vision-powered browser automation tool that automates the transfer of data from emails to CRM/ERP systems. This tool utilizes human-like intelligence to understand emails, receipts, invoices, and more, helping users save a significant amount of time each week.
Automated Workflow
65.1K
BrowseGPT
Browsegpt
BrowseGPT is an AI browser automation plugin that utilizes OpenAI's GPT-3 model to process web pages and execute commands such as clicking, typing text, and navigating. While occasional errors may occur, it provides the reasoning behind each decision, allowing you to assist in correcting them. ?? This is an experimental plugin, please use it with caution and avoid using it on pages containing personal information or where serious consequences could occur. ??
Automated Workflow
76.7K
Featured AI Tools
Flow AI
Flow AI
Flow is an AI-driven movie-making tool designed for creators, utilizing Google DeepMind's advanced models to allow users to easily create excellent movie clips, scenes, and stories. The tool provides a seamless creative experience, supporting user-defined assets or generating content within Flow. In terms of pricing, the Google AI Pro and Google AI Ultra plans offer different functionalities suitable for various user needs.
Video Production
42.8K
NoCode
Nocode
NoCode is a platform that requires no programming experience, allowing users to quickly generate applications by describing their ideas in natural language, aiming to lower development barriers so more people can realize their ideas. The platform provides real-time previews and one-click deployment features, making it very suitable for non-technical users to turn their ideas into reality.
Development Platform
44.7K
ListenHub
Listenhub
ListenHub is a lightweight AI podcast generation tool that supports both Chinese and English. Based on cutting-edge AI technology, it can quickly generate podcast content of interest to users. Its main advantages include natural dialogue and ultra-realistic voice effects, allowing users to enjoy high-quality auditory experiences anytime and anywhere. ListenHub not only improves the speed of content generation but also offers compatibility with mobile devices, making it convenient for users to use in different settings. The product is positioned as an efficient information acquisition tool, suitable for the needs of a wide range of listeners.
AI
42.2K
MiniMax Agent
Minimax Agent
MiniMax Agent is an intelligent AI companion that adopts the latest multimodal technology. The MCP multi-agent collaboration enables AI teams to efficiently solve complex problems. It provides features such as instant answers, visual analysis, and voice interaction, which can increase productivity by 10 times.
Multimodal technology
43.1K
Chinese Picks
Tencent Hunyuan Image 2.0
Tencent Hunyuan Image 2.0
Tencent Hunyuan Image 2.0 is Tencent's latest released AI image generation model, significantly improving generation speed and image quality. With a super-high compression ratio codec and new diffusion architecture, image generation speed can reach milliseconds, avoiding the waiting time of traditional generation. At the same time, the model improves the realism and detail representation of images through the combination of reinforcement learning algorithms and human aesthetic knowledge, suitable for professional users such as designers and creators.
Image Generation
42.2K
OpenMemory MCP
Openmemory MCP
OpenMemory is an open-source personal memory layer that provides private, portable memory management for large language models (LLMs). It ensures users have full control over their data, maintaining its security when building AI applications. This project supports Docker, Python, and Node.js, making it suitable for developers seeking personalized AI experiences. OpenMemory is particularly suited for users who wish to use AI without revealing personal information.
open source
42.8K
FastVLM
Fastvlm
FastVLM is an efficient visual encoding model designed specifically for visual language models. It uses the innovative FastViTHD hybrid visual encoder to reduce the time required for encoding high-resolution images and the number of output tokens, resulting in excellent performance in both speed and accuracy. FastVLM is primarily positioned to provide developers with powerful visual language processing capabilities, applicable to various scenarios, particularly performing excellently on mobile devices that require rapid response.
Image Processing
41.4K
Chinese Picks
LiblibAI
Liblibai
LiblibAI is a leading Chinese AI creative platform offering powerful AI creative tools to help creators bring their imagination to life. The platform provides a vast library of free AI creative models, allowing users to search and utilize these models for image, text, and audio creations. Users can also train their own AI models on the platform. Focused on the diverse needs of creators, LiblibAI is committed to creating inclusive conditions and serving the creative industry, ensuring that everyone can enjoy the joy of creation.
AI Model
6.9M
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase